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^ I A classic result in discrepancy theory is the theorem of Roth [2] stating 

o ' that if the elements of {0, 1, 2, . . . , n} are 2-coloured, there exists an arith- 

<:^ ■ metic progression {a, a+d, . . . , a+{k — l)d} of discrepancy at least (l/20)n^/'*, 

— j. ■ with < a < d < ^/6n. In 1996, Matousek and Spencer [8], building upon 

CN . results by Sarkozy (see [3]) and Beck [5], showed that apart from constants, 

this result is the best possible. 

o 

O ' The situation is quite different, however, for homogeneous arithmetic pro- 

^ ■ gressions (HAPs), the subfamily of arithmetic progressions containing 0. It 

c^ . turns out that there are extremely balanced colourings for such arithmetic 

progressions. Consider the following examples. 

X3(3A; + 1) = 1; Xsi^k + 2) = -1; XsW = Xsik) 

X;{3k + 1) = 1; xli^k + 2) = -1; xim = 

•^ I It is easy to show that all HAPs contained in {0, 1, . . . , n} have discrep- 

(^ I ancy 0(log n) under xs and discrepancy at most 1 under xl- (Of course, 

^ ■ X3 is a 3-colouring, and if we are going to allow we might as well colour 



O 



S 



everything 0, but let us look the other way and at the bright side for a mo- 



"^ ! ment.) In general, for any prime p, we can define a colouring x* via the 

^ ' non-trivial real character modulo p (the Legendre symbol). The discrepancy 

of all HAPs are bounded by {p — l)/2 under this colouring. Also note that 
Xp is "almost admissible" for large p, since only a small fraction of num- 

S^ . bers is coloured 0. Whether there is a "completely admissible" colouring of 

bounded discrepancy for HAPs is a question raised by Erdos in the 1930s, 
and one that remains unsolved to this day. It is indeed a mishap that this 
innocent-looking question should turn out to be so difficult. 

Yet homogeneous arithmetic progressions are tiny herrings on the tip of 
the iceberg of quasi-progressions. Perhaps a definition is in order. A quasi- 
progression Q{a; s, t) is the sequence of integers 



[saj, \_{s + l)aj, . . . , \ta\ 



In other words, a quasi-progression is a sequence of successive multiples 
of a real number, with each multiple rounded down to the nearest integer. 
Since distinct real numbers yield distinct quasi-progressions, we are dealing 
with an uncountable family of sequences. Note that for integer values of a, 
quasi-progressions reduce to HAPs, or the set-difference of two HAPs. Thus 
the problem raised by Erdos concerns a subfamily of quasi-progressions, cor- 
responding to integer values of a. 

Our first theorem gives a lower bound on the discrepancy of the family 
of all quasi-progressions contained in {0,1,..., n}. 

Theorem 1 If the integers from to n are 2-coloured, there exists a > 1 
and integers s and t such that the quasi-progression Q{a\s,t) has discrep- 
ancy at least (l/50)n^/^. 

Proof Let m < n. The value of m will be specified at the end of 
the proof. By Roth's theorem, there exists an arithmetic progression Pi = 
{a,a + d,a + 2d, . . .} contained in {0, 1, ... , m}, 2 < d < \/&m, < a < d, 
with discrepancy at least (l/40)m^/^. Let P2 = {n — m) + Pi. We will show 
that for suitably chosen m, P2 can be realised as a quasi-progression corre- 
sponding to a real number a > 1. 

Observe that if a = d — e, the first [1/eJ elements in the sequence 
[aj, [2a\, [3a\ . . . are congruent to — l(mod d), the next (L2/eJ — |.l/ej) ele- 
ments are congruent to —2 (mod d), and so on. In particular, the arithmetic 
progression P2 = —{d — a) (mod d) can be realised as a quasi-progression 
by choosing e such that P2 is completely contained in the {d — aY^ block of 
length (l/e) + 0(l). 

Since P2 C {n — m,n — m + 1, . . . , n}, it suffices to choose e such that 
(rf — a — 1) [l/e] < n — m and {d — a) [1/eJ > n. Such an e exists if 

n — m n 

> 1 



d — a — 1 d — a 

Note that d — a < d < ^/6m. Therefore, we can choose m = \6~^^^n'^^^\ . 
This yields a quasi-progression of discrepancy at least (l/50)n^/^. | 



While it is not known whether the set of homogeneous arithmetic pro- 
gressions have bounded discrepancy, there exist colourings (see [9]) for which 
the arithmetic progression {0, d, 2d, . . . , } has discrepancy at most d*~^°^-^^ for 
all d. It turns out, however, that upper bounds independent of n do not exist 
for most quasi-progressions. 

Let a > 1 be given, together with a 2-colouring of {0,1,..., n}. Let 
Da{n) denote the maximum discrepancy of Q{a;s,t) over all admissible s 
and t. In 1986, Beck [6] showed that given any 2-colouring of the non- 
negative integers, for almost every a G [l,oo), there are infinitely many n 
such that Da{n) > log* n. Recall that log* x denotes the inverse of the tower 
function: log* x = Ina: for 1 < x < e and log*(e^) = 1 + log* x. 

We improve on this result, and prove the following theorem. 

Theorem 2 Let x be a partial colouring of the non-negative integers 
with density p > 0, and let Xn denote the restriction of x to {0, 1, . . . ,n}. 
Then for almost every a G [l,oo), there are infinitely many n such that 
Da{n) > (logn)V4-«(i). 

Proof Let E denote the set of a such that there are only finitely many n 
with. Dn{(y) < (log n)^/'^~°*^^) under the colouring X. If i? has positive measure, 
there exists a positive integer t for which the set of balanced a in [t,t + l) has 
measure S > 0. But it follows from the Main Lemma (see below) that there 
exists Co = co(p, t, S) such that the set of a with Dn{oi) < co(log nY^^ has mea- 
sure less than S. For all other a in [t, t + 1), we have -D„(a) > {lognY^^^"^^") 
for sufficiently large n, yielding a contradiction. | 

It remains to state and prove the Main Lemma. 

Main Lemma Let x and Xn be as in the statement of Theorem 2. Given 

t G [l,oo) and 6 > 0, there exists Cq = Co{p,t,6) such that the set of a in 
[t, t+1) with Da{n) < Co (log n)^/^ under Xn has Lebesgue measure less than S. 

Remark: We say that a is M-balanced if Da{n) < M. For brevity, we 
shall hereafter refer to (co(logn)^/'^)-balanced a simply as "balanced". We 
will transform the problem into a geometric setting, with a view to using 
orthogonal functions, as was done by Roth [1] in his classic paper on the 



measure-theoretic discrepancy of axis-parallel rectangles. A similar construc- 
tion was used by Hocliberg [7] to show the existence of a quasi-progression 
of discrepancy CQ(logn)^/'^. As we saw in Theorem 1, quasi-progressions with 
much larger discrepancy do occur. 

Proof We shall assume, for the sake of convenience, that n = {t + l)m 
where m = 2^ for some positive integer u. We join each lattice point (a, h) 
with the one vertically above it, and give the resulting unit segment the 
colour xip). For each point {x,y) in the plane, the discrepancy function 
D{x, y) is defined to be the sum of the x- values of the unit segments crossed 
by the line joining (0,0) and {x,y). Note that \D{x,y)\ < M if and only if 
y/x is M-balanced. 




X 



Let H{x,y) = D{x,y) if y/x is balanced, and otherwise. Suppose that 
the measure of the set of balanced a in [t, t -|- 1) is at least 6. We will deduce 
a contradiction for a suitably chosen cq by producing a point {xo,yo) with 
H{xo,yo) > co(logn)^/^. 



Let R denote the region bounded by the hnes x = m/2,x = m,y = 
tx, y = {t + l)x. We will construct orthonormal functions gi, g2, ■■■, gr on R 
where r = (logn)/8 and 



v^.,.. 2 p^^-^^m^ (log n)^/^ 



i=l 



{{H,g>)Y> 



227c2t3 



Since R has area Sm^/S, it follows from Bessel's inequality that there 
exists (iCojt/o) with H{xo,yo) > coilognY^^ for 



Co{p,t,5) 
yielding the desired contradiction. 



142t3/4 



The functions gi,g2, . ■ ■ ,gr will be normalised versions of mutually or- 
thogonal functions Gi, G2, • • • , G^. Following Hochberg, we will construct Gi 
by dividing R into a grid of trapezoids, called the i^^ trapezoidal grid. We 
use vertical lines spaced £ = 2* apart and slanting lines with slopes equally 
spaced between t and t + 1. The slopes of consecutive slanting lines differ by 
r = l/{i(3m) where P = C2(logn)^/*^. The value of C2 will be specified later. 
It is easy to see that the individual grid trapezoids have area at most 1/(3 
and at least 1/(2/3). 
Y 



(t + l)x 




Note that we have specified only the spacing between the grid fines and 
not tfieir actual position. We cfioose tfie position of tfie rigfitmost vertical 
line randomly and uniformly in tfie interval [n — £,n), and tfie slope of tfie 
lowermost line randomly and uniformly in tfie interval [t,t + r). Tfie region 
between two consecutive sloping lines will be called a sector, and sectors will 
be identified witfi subintervals of [t,t + 1) in tfie natural fasfiion. We will 
denote tfie measure of balanced a in tfie j^^ sector of tfie i*^ grid by /i^. For 
convenience, we define fi*j = fiij/r. 

If x{b) 7^ x{b ^1)) we refer to 6 as a switch value. Furtfiermore, a lattice 
point (a, b) will be called a switch point if 6 is a switcfi value. A switcfi point 
is said to be good if it finds itself alone in a trapezoid no matter fiow tfie 
grid is positioned; bad otfierwise. We sfiall denote tfie number of good switcfi 
points in tfie j*^ sector of tfie i^^ grid by s*-. 

We define Gi as follows: On a trapezoid containing exactly one switcfi 
point, Gi is defined in a cfieckerboard fasfiion. On all otfier trapezoids, Gi is 
defined to be identically zero. 




Tfie vertical dividing line passes tfirougfi tfie centre of tfie trapezoid. Tfie 
position of tfie slanting dividing line is cfiosen sucfi tfiat tfie measure of bal- 
anced a above tfie line and inside tfie sector equals tfie measure of balanced a 
below tfie line and inside tfie sector. Tfie value of s will vary from trapezoid 
to trapezoid, but will always equal +1 or —1. Since tfie vertical dividing lines 
are nested dyadically (note tfiat tfie vertical spacing is £ = 2*), it is clear tfiat 
{Gi} form an ortfiogonal family. 



We now derive a lower bound on the inner product {H,Gi). The posi- 
tion of the slanting dividing line has been chosen with a view to extending 
Hochberg's argument for the /i*- = 1 case to the more general problem at 
hand. 



Lemma 1 E{{H,G,)) > iEif^*,)'s*^)/{32f3) 

j 

Proof Consider the contribution of a unit vertical segment ia,b joining (a, b) 
and (a, 6 + 1) to the discrepancy function H{x, y). Let 



Bn 



b y b + 1 

\x,y) : X > a, — < — < 

ax a 



denote the set of points behind the line (!,a,h- 
Y 




(0,6+1 



- X 



Now define 

Ha,b{x,y) 
Clearly, 



x{b) if {x,y) G Ba^b and y/x is balanced 
otherwise 



H{x, y) = ^^ Ha,b{x, y) 

a=0 b=0 

Furthermore, only finitely many terms in this sum are non-zero, for any fixed 
(x, y). Consider a good switch point (a, b) lying inside a trapezoid T, located 



7 



in the j^^ sector of the i*'^ grid. 

We claim that if neither (a, b) nor (a, 6 + 1) he inside T, then 



Gi{x,y)Ha,b{x,y)dxdy = 




T 



If T hes entirely outside or entirely inside -Ba,b) it is clear that the integral 
is zero. If exactly one of the bounding lines of Ba^b intersects T, the geometric 
symmetry with respect to the vertical dividing line or the measure-theoretic 
symmetry with respect to the sloping dividing line, as the case may be, en- 
sures that there is perfect cancellation. Thus the integral vanishes in this 
case as well. 





Therefore, we need consider only the terms Ha,b{x,y) and Ha^b-i{x,y), 
where (a, b) G T. If (a, b) is not a switch point, we have. 



Gi{x,y){Ha,bix,y) + Ha,b-iix,y)) dxdy = 





Now suppose that (a, b) is a switch point. If (a, b) hes on the intersection 
of the two dividing hnes, we have 




Gi{x,y){Ha,b{x,y) + Ha,b-i{x,y)) dx dy = -(x(&) -x(&- l))Kj area(T) 




We choose s so that the integral is positive. Since the switch point p is 
2;ood, there are no other lattice points in T, and the value of s can now be 



safely assumed fixed. Thus we get 




a ■ 
Gi{x, y)H{x, y))dxdy> -^ 



provided (a, b) lies on the intersection of the two dividing lines. Since the 
location of (a, b) inside the trapezoid is a uniformly distributed random vari- 
able, we have 



eI / Gi{x,y)H{x,y)dxdy\ > 



{Ki 



* \2 



32/3 



Adding over all switch points and using the linearity of expectation, we 



get 



j 



* ^2 * 

ij) ^ij 



as claimed. | 

We now prove a slightly stronger version of a lemma due to Beck [6]. 

Lemma 2 Let J C [0, 1] be an arbitrary interval of length A and let 1 < 
bi <b2 < . . .bqhe integers. Let N{a, J) = |{j : {bja} G J, 1 < j < g}|. If 
q > A-^ then fx{{a E [0, 1] : N{a, J) > (gA/2)}) > 1 - (8/^). 

Proof The proof uses LeVeque's inequality from the theory of uniform dis- 
tributions, and is almost identical to the proof of Beck's original lemma. 

Let Xj = {bja}, 1 < j < q. Define A(a) and Sn{(^) as follows: 



A(a) 



sup 

0<a<6<l 



SJa) 



yj:xj€la,b) 



1 "^ 



,2Trinxi 



10 



Note that 



Jo q'Jo Ut", q 



By LeVeque's inequality, 



a^(«)<4e'^"^"^'' 



vr^ -i — ^ 7^2 



riGN 

Therefore, 



^ 6 

A^(a)da < — 



TT 



/'(Ei|s>,p)..4 



'0 

Let E = {ae [0, 1) : N{a, J) > q\/2} and F = [0, 1) \ E. Clearly, 



AV(i^)^ r' 



Ei 



A 



7 ? 



1 

da < I A"^(q;) (io; 




8 .0 

Therefore, X^n{F)/8 < 1/q. Since g > A^®, we have 

Mi5) = i-Mi^)>i-^, 

proving the lemma. | 

Let bi,b2, ■ ■ ■ ,bqhe the switch values of the colouring x in [^/2, A^]- Note 
that q > (A^p)/(4co(t + l)(logn)^/^) = (mp)/(4co(logn)i/^). Since switch 
points come in rows, it is clear that Xn gives rise to mq switch points. 

Lemma 3 E(Ki)^4 > S^mq/{4096t), for 1 < i < r 

j 

Proof We say that a sector is rich if fi*j > S/2. Let {Ik}k=i be an enu- 
meration of the rich sectors. Since (6/2) {ni(3 — L) + Lr > 6 , we have 
L > {6/2)mi(3. 

We use Lemma 2 with J = [0, 5/{Aip{t + 1)], so that A = 5/{A£p{t + 1)). 
Since r = log?72/8 and i < r, we have q > A~^. For an arbitrary in- 
terval I = [ttkybk), let /' and /" denote [ak,Ck) and [ck,bk) respectively, 

11 



where the measure of balanced a in V and I" are equaL Let A = U^^^^/^. 
Note that A has measure at least 5^/8. Let B = {6 : 1/6 G A}. Since 
AC[t,t + l) C [t, 2t), the measure of B is at least 5'^/{32t'^). 

Let 5* = {^ G 5 : iV(^, J) > (gA)/2}. For sufficiently large n, B* 
has measure at least 5^/(64t^). Note that 9 e B* ^ 1/6 G /^ for some k. 
Suppose {K6} G J for such a ^. Let a„ = [6^,^] . Then we have, 

< < — ^ L < 

ttv 6 m 4 




It follows that (by/ttv) G /fc. Thus the /c*^ sector contains a switch point 
of the form (a^,, 6^). 

Since the contribution of a single J^ towards the measure of B* is at most 
r/t^, there must be at least (5^m£/9/(32t^) rich sectors contributing at least 
(5^mg/(1024t) switch points between them. 

We now derive an upper bound on the total number of bad switch points. 

Given a bad switch point (a, b), there exists a' such that 

la' — al < i and — < 
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a', b') 
a', ha' /a) 



Let d = \a' — a\. Note that there are m(i/2 pairs (a, rf) with 1 < d < 
£, m/2 < a < m. For each such pair, there are a/{il3) + 0{i) values of b 
that satisfy ||6(i/a|| < l/{i(3). It follows that there are at most m'^/j3 lattice 
points which do not find themselves alone in a trapezoid for some placement 
of the grid. 

Since the number of bad switch points is at most 



2 2 rS 

m m omq 

~J' ^ C2(logn)i/4 < I024t 

for C2 > 4096cot/((5^p), we have 

6^mq 



EK 



2 * 

*^' ^'^' ^ 4096t 



as required. | 

Note that 



E{{H,G,f) > [E{{H,G,))r > ( E^^' 



Furthermore, the combined area of all the grid trapezoids containing ex- 
actly one switch point is at most mq/ j3. Therefore, 



E{{H,g.:)')>Y^ 



* ^2„* \2 



lA'-jiJ ^ij) 



-ij) -to J ^ p^5^^m^ 



1024mq(3 " 2"^^ clt^ {log ny/^ 
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By the linearity of expectation, 

^ilr* '■'/ — ^^' — 

Thus, for some placement of the grids, the resulting Qi satisfy 

2^i{H,g>)) > ^^^-2^5 

1=1 ^ 

yielding the statement of the main lemma. | 
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